Identification Of Outliers In Oxazolines AND Oxazoles High Dimension Molecular Descriptor Dataset Using Principal Component Outlier Detection Algorithm And Comparative Numerical Study Of Other Robust Estimators
نویسندگان
چکیده
From the past decade outlier detection has been in use. Detection of outliers is an emerging topic and is having robust applications in medical sciences and pharmaceutical sciences. Outlier detection is used to detect anomalous behaviour of data. Typical problems in Bioinformatics can be addressed by outlier detection. A computationally fast method for detecting outliers is shown, that is particularly effective in high dimensions. PrCmpOut algorithm make use of simple properties of principal components to detect outliers in the transformed space, leading to significant computational advantages for high dimensional data. This procedure requires considerably less computational time than existing methods for outlier detection. The properties of this estimator (Outlier error rate (FN), Non-Outlier error rate(FP) and computational costs) are analyzed and compared with those of other robust estimators described in the literature through simulation studies. Numerical evidence based Oxazolines and Oxazoles molecular descriptor dataset shows that the proposed method performs well in a variety of situations of practical interest. It is thus a valuable companion to the existing outlier detection methods.
منابع مشابه
Study Of E-Smooth Support Vector Regression And Comparison With E- Support Vector Regression And Potential Support Vector Machines For Prediction For The Antitubercular Activity Of Oxazolines And Oxazoles Derivatives
A new smoothing method for solving ε -support vector regression (ε-SVR), tolerating a small error in fitting a given data sets nonlinearly is proposed in this study. Which is a smooth unconstrained optimization reformulation of the traditional linear programming associated with a ε-insensitive support vector regression. We term this redeveloped problem as ε-smooth support vector regression (ε-S...
متن کاملThe rapid synthesis of oxazolines and their heterogeneous oxidation to oxazoles under flow conditions.
A rapid flow synthesis of oxazolines and their oxidation to the corresponding oxazoles is reported. The oxazolines are prepared at room temperature in a stereospecific manner, with inversion of stereochemistry, from β-hydroxy amides using Deoxo-Fluor®. The corresponding oxazoles can then be obtained via a packed reactor containing commercial manganese dioxide.
متن کاملPerformance Analysis Of Neural Network Models For Oxazolines And Oxazoles Derivatives Descriptor Dataset
Neural networks have been used successfully to a broad range of areas such as business, data mining, drug discovery and biology. In medicine, neural networks have been applied widely in medical diagnosis, detection and evaluation of new drugs and treatment cost estimation. In addition, neural networks have begin practice in data mining strategies for the aim of prediction, knowledge discovery. ...
متن کاملPerformance Analysis Of Regularized Linear Regression Models For Oxazolines And Oxazoles Derivitive Descriptor Dataset
Regularized regression techniques for linear regression have been created the last few ten years to reduce the flaws of ordinary least squares regression with regard to prediction accuracy. In this paper, new methods for using regularized regression in model choice are introduced, and we distinguish the conditions in which regularized regression develops our ability to discriminate models. We a...
متن کاملNon linear Prediction of Antitubercular Activity Of Oxazolines and Oxazoles derivatives Making Use of Compact TS-Fuzzy models Through Clustering with orthogonal least sqaure technique and Fuzzy identification system
The prediction of uncertain and predictive nonlinear systems is an important and challenging problem. Fuzzy logic models are often a good choice to describe such systems, however in many cases these become complex soon. commonlly, too less effort is put into descriptor selection and in the creation of suitable local rules. Moreover, in common no model reduction is applied, while this may analyz...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1312.2861 شماره
صفحات -
تاریخ انتشار 2013